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Abstract. Non-Gaussian noise transients in interferometric gravitational- wave 
detectors increase the background in searches for short-duration and un-modelled 

QQ signals. We describe a method for vetoing noise transients by ranking the statistical 

■^^ relationship between triggers in auxiliary channels that have negligible sensitivity to 

gravitational waves and putative gravitational- wave triggers in the detector output. 
The novelty of the algorithm lies in its hierarchical approach, which leads to a minimal 
set of veto conditions with high performance and low deadtime. After a given channel 
has been selected it is used to veto triggers from the detector output, then the algorithm 

^"H selects a new channel that performs well on the remaining triggers and the process is 

^ repeated. This method has been demonstrated to reduce the background in searches 

J^ for transient gravitational waves by the LIGO and Virgo collaborations. 

PACS numbers: 95.55.Ym, 04.80.Nn, 07.05.Kf 

1. Introduction 

The first generation of kilometer-scale interferometric gravitational- wave detectors, 
LIGO [1], Virgo [2] and GEO 600 [3], liave completed several years of network observation 
of the 40-10000 Hz frequency range. The data has been searched for various types 
of gravitational radiation including stochastic sources such as the early universe [4], 
continuous sources such as spinning neutron stars [5] , coalescence of binary systems of 
black holes or neutron stars [6] , and searches for un-modelled or poorly modelled bursts 
such as supernovae [7]. So far no gravitational- wave detection has been made, however 
data analysis is ongoing. The next generation of detectors including Advanced LIGO [8] 
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and Advanced Virgo [9] are expected to observe gravitational waves from compact binary 
coalescence [10] within this decade. 

Even with highly sensitive detectors, gravitational- wave searches are limited by 
noise. In addition data from all interferometric gravitational-wave detectors to date 
has shown a characteristic large non-Gaussian tail from non-astrophysical sources. In 
general, the shorter and less well-modelled a true signal is, the more difficult it is to 
distinguish from noise transients using signal processing. Requiring coincidence and 
coherence among multiple widely-separated detectors is an important and effective way 
to reduce the influence of transients. Still, the performance of searches for un-modelled 
bursts and high-mass binary coalescence signals (which have short duration in these 
detector's frequency band) is greatly diminished by transients in the detector data. 
This sets a practical limit on the sensitivity of the searches and on the false alarm rate 
that can be ascribed to candidate gravitational- wave signals. 

Interferometric gravitational-wave detectors are designed to be isolated from 
all signiflcant non- gravitational- wave external phenomena (seismic, electromagnetic, 
acoustic), and they are equipped with systems to monitor both the local environment 
and auxiliary interferometer channels for disturbances. In addition, there is a large effort 
to identify poor quality data and to link these to causes in the local environment or to 
aspects of the instrument itself [11, 12, 13] so that the noise transients can be removed 
through improvements to the instrument or by "vetoing" [14, 15, 16], whereby periods 
of demonstrated low-quality data are removed from an analysis. The method described 
in this paper and the "used percentage veto" described in [14] were the two methods 
most extensively used during the most recent science runs for both LIGO and Virgo. 

Figure 1 shows a cartoon example of strain data in the detector output channel, 
referred to hereafter as h(t). Here some periods of h(t) data are discarded because they 
are associated with hypothetical disturbances: high local wind speed and loud acoustic 
transients in a detector building. One useful indication of the effectiveness of a given 
veto is the ratio of the efficiency^ the percent of noise transients vetoed from h(t)^ to 
the deadtime^ the percent of the analysis time that is removed. If this ratio is high, the 
veto is considered useful. If the ratio is close to one, the veto performs no better than 
time removed at random. In this paper we refer to noise transients that are deemed 
signiflcant (i.e. by crossing some threshold) as triggers. In the hypothetical situation 
shown in Figure 1, three of four triggers in h(t) are removed at a cost of about 25% 
deadtime, giving a ratio of 3. Another indicator of veto utility is the percentage of an 
auxiliary channel's triggers that are coincident with a trigger in h(t)., the so-called use 
percentage. In Figure 1 the two triggers in the microphone channel each veto a trigger 
in h(t)., so its use percentage is 100%. 

In practice, auxiliary channels show excursions with a continuum of magnitudes, 
and there may be short time offsets between the disturbances and the triggers in h(t). 
Without a priori knowledge of what are the relevant amplitudes and time windows to 
use, optimizing the many parameters needed to deflne an effective set of vetoes is a 
challenging problem. 
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Figure 1. Illustration of the removal of some data from the h(t) channel due to 
its association with two hypothetical non-astrophysical disturbances, to obtain an 
improved data stream. The top trace, h(t)^ represents the h(t) data. The middle 
trace is a monitor of wind speeds on the detector site, while the lowest trace is a 
microphone located in one of the detector's buildings. The first and second vetoed 
period in h(t)^ between pairs of dashed lines, are removed due to association with 
sharp glitches in the microphone, while the third period is removed because of high 
local wind speeds. This data removal would be done after a relationship between these 
types of disturbances and noise transients in h(t) had been established. 



This paper describes a hierarchical veto algorithm called hveto^ used for the 
identification and removal of noise transients in searches for short-duration or poorly 
modelled gravitational waves. Its implementation for the LIGO and Virgo gravitational- 
wave detectors makes use of the hundreds of auxiliary channels recorded by each. 
It identifies the subset of channels that have negligible sensitivity to gravitational- 
wave signals, then determines which of these exhibit a significant relationship with 
transient noise present in h(t). Auxiliary channels are ranked based on their statistical 
significance, which quantifies how unlikely the number of time coincidences between 
triggers in h(t) and triggers in auxiliary channel are, in comparison with the number 
expected by chance based on Poisson statistics. 

The most novel feature of hveto is that it is hierarchical. The basic idea of using 
noise transients detected in auxiliary channels to veto putative signals has been around 
since the prototype era. One shortcoming of these approaches was that the significance 
of each channel was evaluated individually with respect to h(t)^ leading to many channels 
being adopted as vetoes even though they were largely vetoing the same set of triggers. 
Since not all channels have high use percentage, this led to unnecessarily high deadtime. 
The goal of the hierarchical method is to find a minimal set of veto conditions that 
collectively has a high efficiency and low deadtime by selecting the best available veto, 
removing its effects from h(t)^ and then iterating the process on the remaining triggers 
to produce only as many vetoes as have a statistically signiffcant effect. 
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Section 2 describes the statistic used by hveto to rank potential veto channels. 
Section 3 presents a flowchart and description of the hveto algorithm, and Section 4 
goes through a set of illustrative results from one week of LIGO data. 

2. Statistical significance 
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Figure 2. Illustration of how coincidence is determined. Triggers in h(t) are shown in 
the top trace. Triggers in the auxiliary channel, shown in the bottom trace, are assigned 
a time window. If an h(t) trigger falls within the time window around an auxiliary 
channel trigger, the triggers are considered coincident, here coincident auxiliary channel 
triggers are circled. 



The hierarchical veto algorithm looks for coincidence between two different types of 
triggers: putative gravitational-wave triggers from h(t) and non-astrophysical triggers 
from auxiliary channels. The auxiliary channels are ranked based on the significance of 
their relationship with h(t). First the number of time-coincidences between the triggers 
in a given channel and the triggers in h(t) are counted, as shown in Figure 2. Then 
the channels are ranked by a measure of how unlikely it is that their observed number 
of coincidences arose from the intersection of two random Poissonian time distributions 
with the same numbers of occurrences and time window. For each auxiliary channel, 
the significance is computed as:j:. 



S = -lo: 




P{fi,k) 



(1) 



where n is the number of coincidences found between noise transients in that channel and 
noise transients in h(t) during a given time Tfot^ and P(/i, k) is the Poisson probability 

:|: The value of Ppoi{fi^ k) is difficult to compute because its numerator is the factorial of a potentially 
large number. Alternatively the value can be computed using the incomplete gamma function. In our 
Matlab implementation, significance is calculated as, sig(n) = -loglO(gaminainc(inu,k, ^ lower O). 
When k is very large compared to fi the incomplete gamma function exceeds double precision limits 
and Matlab returns zero, resulting in infinite calculated S. In this case, we substitute a non-divergent 
approximation, YlV Ppoi{l^^k) ^ P{ii^k)^ which is implemented in Matlab as a sum of logarithms, 
sig(n) = -k*loglO(inu) + inu*loglO(exp(l)) + gaiiimaln(k+l)/log(10). 
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distribution function, 

P{p,k) = ^. (2) 

Here /i is the expected number of chance coincidences between random triggers in h(t) 
and in the auxihary channel, estimated by, 

M= 7^ . (3) 

-I- tot 

where N^ and Naux are the number of triggers in h(t) and auxihary channel, respectively, 
and Ty^in is the full width of the coincidence window used. 

Shortly put, significance is — log;LO ^^ the total probability of observing as many 
or more coincidences between two series of random occurrences than were actually 
observed § 

3. The hveto algorithm 

A fiowchart of the hveto algorithm is shown in Figure 3. In the pre-processing stage, 
inputs consisting of a configuration file^ a list of science segments^ and a set of trigger 
files are assembled. The configuration file contains all user-defined variables, including 
a detector name, a start and stop time defining the analysis period, a frequency range 
to consider, a list of thresholds on the signal-to-noise ratio (SNR) of triggers, a list of 
time- windows over which to check for coincidence, and a Significance Threshold\\ below 
which the search for further vetoes will terminate. The list of science segments defines 
the subset of the full analysis period when the detector was operating normally. The 
trigger files contain a list of noise transients parametrized by their, time, frequency, and 
SNR. There is one trigger file for h(t) and one for each auxiliary channel. Trigger files 
are produced beforehand by other algorithms, such as Kleinewelle^ Omega (formerly 
Q-pipeline) [17] and iHope [18], that identify short-duration candidate signals based on 
their excess power and/or likeness to signal models. 

Vetoing h(t) triggers based on a statistical relationship with auxiliary channels is 
only permissible if the auxiliary channels have negligible sensitivity to gravitational 
waves. Such channels are referred to as safe for use in defining vetoes. Some auxiliary 

§ A short numerical example is as follows. Consider a week of data, Ttot = 604800 s, during which the 
number of h(t) triggers is N^ = 1400 and the number of triggers in a particular auxiliary channel is 
^aux = 1100. If a time window of T^in = 0.10 s is used to check for coincidence, the expected number 
of coincidences is /i = 0.25. If the number of observed coincidences were one, the significance would 
be S* = 0.65, and if instead the number of coincidences observed were 30, the significance would be 
5' = 50. 

II The rate of noise transients in gravitational- wave data typically varies with time as conditions change. 
Therefore the mean rate of chance coincidences calculated from equation 2 may not be accurate. To 
account for this and other errors, a significance threshold can be determined empirically by using 
time-slide analysis (e.g., adding a seconds- long artificial offset to the central times of h(t) triggers) to 
calculate the significance typical of the coincidences between auxiliary channels and h(t) when there is 
no causal relationship. Significance of up to 5 is often observed. 



Hierarchical veto 



START 



^Science Segments , 



C/) 
C/) 
CD 
O 
O 
L_ 

Q_ 
I 

CD 

1— 

Q_ 



(/) 

O 
O 



O 
CL 




Remove triggers outside science 
segments 



I 



Configuration 

Detector: LIGO Livingston 

Time Range: [959126400-959731200] sec 

Frequency range: [32-4098] Hz 

SNR Thresholds: [10, 12, 15, 20, 40, 100, 300] 

Time Windows: [0.1, 0.2, 0.4, 0.8, 1.0] sec 

Significance Threshold: 15 

Safe Channel List 
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Figure 3. Flowcliart of tlie liveto algoritlim described in tlie text. Tlie configuration 
values are tiiose used for Section 4. 



channels, particularly those measuring degrees of freedom of the interferometer, are 
weakly coupled to h(t) and thus demonstrate a non-negligible response to gravitational- 
wave strain. During data taking runs, signals that mimic the behavior of gravitational 
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waves are injected into the instruments by physically moving the test masses to make 
a time- varying strain signal. Many of these hardware injections are performed at 
intentionally high SNR in the detectors to verify the safety of auxiliary channels; for a 
channel to be considered safe, it should not respond to hardware injections. To evaluate 
this, prior to the process outlined in Figure 3, hveto compares a list of hardware injection 
times to auxiliary channel triggers, counts the number of coincidences within a 100 ms 
time window, and computes the significance of the coincidences in each channel. Any 
channel that is assigned a significance of greater than 5^=3 (conservative) is considered 
unsafe for use in veto production and removed %. 

The Process stage of hveto proceeds in rounds. In the first round, a set of all 
possible veto conditions (all combinations of auxiliary channels, time windows, and 
SNR thresholds given in the configuration) is created. Their times are compared 
to those in the h(t) channel, coincidences are counted as described above, and the 
statistical significance of each channel is computed following Equation 1. The one 
channel/window/SNR combination with the highest significance is declared the winner 
of the first round. 

If the significance of the round winner is greater than the Significance Threshold, 
its trigger times are turned into veto segments by adding and subtracting half the time 
window width. The first round ends when all of the winner's veto segments are applied, 
removing those times from the segments over which h(t) and all auxiliary channels are 
further analyzed. 

The second round proceeds exactly as the first, by evaluating the statistical 
significance of each veto condition on the remaining h(t) triggers and determining a 
winner. However, now the noise transients related to the winning channel from the first 
round have been vetoed. So the winner of the second round will be diflFerent than that 
of the first, and is likely to relate to a different noise mechanism. This is the key to the 
hierarchical operation of hveto. 

The rounds continue until the significance of a round-winning combination does 
not exceed the Significance Threshold. The result of the entire process is a short list of 
veto conditions, usually less than a dozen, that collectively have a high efficiency and 
low deadtime. 

Finally, in the Post-processing stage the cumulative effect of all of the veto segments 
produced is assessed, statistics and plots (some of which are presented in the next 
section) are generated, and all of this information is posted to a web page. 

4. Illustrative results 

This section illustrates the utility of the hveto algorithm through application to one week 
of data from the LIGO Livingston Observatory between May 29 and June 4 2010. For 
more information on the LIGO systems mentioned here, see [1]. The h(t) and auxiliary 

% This value of significance was chosen empirically to reject channels with known low-level coupling, 
without too many false rejections. 
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channel triggers were generated with KleineweUe. The algorithm completed a total of 11 
rounds before reaching the Significance Threshold of 15. The results indicated a variety 
of glitch mechanisms present in the detector. Two of the round winners, as well as the 
cumulative effects of the vetoes, are described below. 

The second round was won by the channel ASC-ITMX_P, hereafter ITMX Pitchy 
which represents the pitch angular motion of the input-coupler optic for the long Fabry- 
Perot Cavity in the X-arm of the interferometer. Figure 4 shows the SNR of the triggers 
in this channel versus time. Also indicated is the subset of triggers, 548 of the total 
1947, that were coincident with triggers in h(t). The complete set of triggers were used 
to construct veto segments. Figure 5 shows the SNR of the triggers in h(t) versus time, 
and the subset, 552 of 2354, of these that were vetoed by the ITMX Pitch veto. This 
veto had a high 23% efficiency and low 0.052% deadtime. 



10^ 



00 



SNR vs. Time: Detector=Ll, Round=2, Winner=Ll_ASC-ITMX_P_8_256 
Times offset of GPS=959126400, UTC=20 10-05-28 23:59:45 



SNR vs. Time: Detector=Ll, Round=2, Winner=Ll_ASC-ITMX_P_8_256 
Times offset of GPS=959 126400, UTC=20 10-05-28 23:59:45 
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Figure 4. SNR versus time for 
noise transients in the ITMX pitch 
channel. The subset of these 
coincident with a transient in h(t) 
are marked with blue plus signs. 



Figure 5. SNR versus time for 
noise transients in h(t). The subset 
of these vetoed by the ITMX pitch 
channel are marked with red plus 
signs. 



A novel feature of hveto is its "significance drop plot" . This is useful for indicating 
whether any other channels than the round winner were sensitive to a population of 
triggers vetoed in a given round. It is produced by plotting the highest significance for 
each channel over all choices of threshold and time window in a given round as one end 
point of a vertical line, and then plotting the highest significance value for the following 
round as the other end point. A drop in significance for a channel from one round to 
the next indicates that some of its coincidences are the same as those vetoed by the 
round winner. If the significance stays the same (or increases, which can happen on 
occasion when the veto leads to a small decrease in overall analysis time), the channel 
has no relationship with the disturbance vetoed by the round winner. Typically the 
winning channel for a given round will have a sharp decrease in significance, indicated 
by a tall line. Note however that the significance of a winning channel does not always 
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Figure 6. Significance drop plot for the second round, won by ITMX Pitch. 
Each vertical line has two endpoints corresponding to the highest significance for its 
corresponding channel in a given round, and the following round, respectively. Blue 
lines indicate that the significance of that channel is lower in the later round, and red 
lines indicate the significance is the same or higher. A subset of instrumental sensing 
and control channels are shown. From left to right channel prefixes and their general 
description are ASC: alignment, 100: frequency, LSC: length, OMC: output mode 
cleaner and SUS: suspensions. 



drop identically to zero in the following round, because the same channel may still have 
some coincidences at a lower SNR and/or wider time window. 

Significance drop plots are useful for identifying "families" of channels that are all 
sensitive to the same type of disturbance. This can provide more information about the 
true origin of a disturbance, for example by localizing it to channels associated with a 
given building or subsystem. Figure 6 shows the drop plot for the round won by ITMX 
Pitch. The first 24 channels from left to right across the x-axis correspond to alignment 
sensing and control systems. The largest significance drop is for the round winner, but 
the next seven largest significance drops are all for other channels in the interferometer 
that sense pitch alignment. The corresponding channels for yaw motion (for example 
ASC-ITMX_Y) show little or no significance drop. This indicates that the underlying 
disturbance was nearly entirely in pitch, and was sensed throughout the interferometer. 

Note that a non-hierarchical veto method based on significance would apply vetoes 
from eight different pitch alignment sensors here, each with imperfect use percentage and 
therefore additional deadtime, before applying a veto related to a different disturbance. 
In hveto, once the ITMX Pitch veto segments are applied, the significance of all pitch- 
related alignment channels drops considerably and none is selected again until ITMY 
Pitch in the tenth round. 

As a second example, the sixth round was won by the channel SUS-ETMY_SENSOR_SIDE, 
hereafter Side Sensor^ an optical monitor of the side-to-side position motion of the end 
test mass suspension of the Y-arm. Figure 7 shows the SNR of the triggers in the Side 
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Sensor channel versus time, and the subset of these triggers that coincided with a trigger 
in h(t). The ghtches, which were thought to have been due to a digital issue, repeated 
throughout the week with an SNR of just above 100. Of the 34 total triggers at this 
SNR threshold, 33 vetoed a trigger in h(t)^ giving a use percentage of 97%, indicating 
a highly selective veto. Figure 8 shows the SNR of the triggers in h(t) versus time, and 
the subset that are vetoed by the Side Sensor channel. The combined information in 
these figures suggests that a population of noise transients with SNR of around 100 in 
the Side Sensor correspond to triggers in h(t) with an SNR of 20-30. The fact that the 
transients in the Side Sensor have larger SNR than the corresponding transients in h(t) 
is consistent with a causal coupling from the Side Sensor. 



SNR vs. Time: Detector=Ll, Round=6, Winner=Ll_SUS-ETMY_SENSOR_SIDE_8_256 
Times offset of GPS=9591 26400, UTC=2010-05-28 23:59:45 
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SNR vs. Time: Detector=Ll, Round=6, Winner=Ll_SUS-ETMY_SENSOR_SIDE_8_256 
Times offset of GPS=959 126400, UTC=20 10-05-28 23:59:45 
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Figure 7. SNR versus time for 
noise transients in the Side Sensor 
channel. The subset of these 
coincident with a transient in h(t) 
are marked with blue plus signs. 



Figure 8. SNR versus time for 
noise transients in h(t). The subset 
of these vetoed by the Side Sensor 
channel are marked with red plus 
signs. 



Figure 9 shows the drop plot for the round won by the Side Sensor. In contrast 
to the previous example, the Side Sensor is the only channel that experiences a large 
significance drop. This indicates that the Side Sensor was acting alone - the population 
of glitches it sensed were seen by no other auxiliary channels. 

The previous two examples illustrate the ability of hveto to distinguish between 
different classes of disturbances, by identifying the families of channels that sense them 
and the degree to which they couple to h(t). Figure 10 gives an indication of the variety 
of different glitch classes that hveto can identify. Here the starting set of h(t) triggers 
are shown with black circles, and the vetoed triggers through the first six rounds are 
shown with colored markers. Qualitatively the disturbances range from well-localized 
in time or in SNR to more isotropically distributed. 

Finally, an overview of the effectiveness of all vetoes can be seen in a plot of the 
cumulative efficiency versus cumulative deadtime, as shown in Figure 11. During this 
week 60% of h(t) triggers are vetoed at a cost of only 0.85% deadtime, for a ratio 71. 
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Figure 9. Significance drop plot for the sixth round, won by the Side Sensor. Here 
no other channels show a relationship with the vetoed disturbance. 
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Figure 10. SNR versus time 
for noise transients in h(t) (black), 
and the subset of these vetoed in 
each of the first six rounds (colored 
markers). 



Figure 11. Efficiency versus 
deadtime for all rounds. Each 
round is marked with a circle (the 
initial condition of zero deadtime 
and efficiency is also marked) . The 
slope of each line segment is the 
efficiency to deadtime ratio for that 
round. 



Note that the efficiency to deadtime ratio for each round is always much greater than 
one, indicating very effective vetoes. This ratio decreases overall as the rounds increase. 
However the slope for each successive round does not always decrease, because although 
the statistical significance is related to the efficiency to deadtime ratio, they are not 
directly proportional. 
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5. Summary and discussion 

The algorithm described here identifies statistical relationships between putative 
gravitational-wave triggers in h(t) and triggers in auxiliary channels that have no 
astrophysical sensitivity It operates hierarchically to produce a minimal set of veto 
conditions that have a high eflSciency to deadtime ratio. Once a veto condition is 
identified, its effects are removed, and new veto conditions are sought out using only 
the remaining triggers, revealing potentially different glitch mechanisms. In addition 
to vetoes, the algorithm produces information such as drop plots that can be used to 
identify families of channels that sense the same disturbances. 

Although this paper described illustrative results from only one detector, this 
algorithm has been used on all LIGO and Virgo detectors and those results will be 
presented in collaboration papers. It has proved useful for LIGO and Virgo science 
by producing vetoes that reduced the background in gravitational-wave analyses and 
providing hints used to develop data quality fiags and to improve the detectors. This 
and complementary methods will reduce the impact of non-Gaussian noise on searches 
for gravitational waves with Advanced LIGO [8] and Advanced Virgo [9]. 
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